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Abstract 

A recent Letter in Physical Review Letters, "Contextual Values of 
Observables in Quantum Measurements", by J. Dressel, S. Agarwal, and 
A. N. Jordan 1 (abbreviated DAJ below), introduces the concept of 
"contextual values" and claims that it leads to "a natural definition of a 
general conditioned average that converges uniquely to the quantum weak 
value in the minimal disturbance limit". However, they do not define 
"minimal disturbance limit" . The present paper is in part the saga of my 
search for a definition of "minimal disturbance limit" under which this 
claim could be proved. The search finally ended with what is probably a 
definitive counterexample to the claim. 

In addition, the present work analyzes the arguments of DAJ in detail 
and relates them to traditional weak value theory. Various gaps and 
possible errors in its arguments are explicitly noted. 

1 Introduction for Versions 4 and 5 

This Introduction assumes that the reader is at least vaguely familiar with 
Versions 2 and 3. The next section gives a more leisurely introduction reprinted 
from Version 2. 

This paper has gone through a number of incarnations. Version 1 and the 
nearly identical Version 2 formulated what seemed at the time a reasonable 
definition of DAJ'f0 undefined "minimal disturbance limit" and presented a 
counterexample to its claim that their "general conditioned average . . . converges 
uniquely to the quantum weak value in the minimal disturbance limit" , assuming 
this definition. 



*For contact information, go to| http: / /www. math. umb.edu/~sp| 
^Abbreviation for Dressel, Agarwal, and Jordan's paper 
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Later, I learned that the my definition did not correspond to the authors'. 
(For the strange story of how I learned this, see the "Afterword" section below.) 
My definition corresponded to what would typically be called "weak" measure- 
ments, which are not the same as the "minimally disturbing measurements" 
discussed just below. 

DAJ was assuming that the quantum measurements were slightly more gen- 
eral than what the the book of Wiseman and Milburn ([H], Section 1.4.2) calls 
"minimally disturbing measurements" . These are defined as measurements for 
which the measurement operators are positive. (They do not correspond to 
what one might guess given only the normal associations of the English phrase 
"minimally disturbing measurements" ; see the Afterword to Version 3 for more 
information.) 

There is no limit in this definition, so DAJ's "minimal disturbance limit" 
remains undefined, but I assume it is what would normally be called a "weak 
limit" (i.e., in the limit, the effect of the measurement on the state being mea- 
sured goes to zero) of their generalization of Wiseman/Milburn's "minimally 
disturbing" measurements!! 

The present Version 4 presents a counterexample to the above claim of DAJ 
assuming that the measurements are "weak" in the sense described in Versions 1 
and 2 (i.e., what Versions 1 and 2 assumed DAJ meant by "minimal disturbance 
limit") and in addition are "minimally disturbing measurements" in the sense 
of Wiseman/Milburn (pTj just described. 

This new counterexample is in Section [12] and should not be confused with 
the counterexample of earlier versions in Section |9l Since the previous coun- 
terexample is still correct as stated, I have not removed it, but its hypotheses 
are different from those of the new example. Sections 2 through 11 are almost 
identical to Version 3; only one significant typo has been detected and corrected. 

Version 5 is mathematically identical to Version 4. Minor improvements in 
the exposition have been made, and a bad typo in equation (|133[) has been 
corrected. 

2 Brief introduction from Version 2 

This work discusses the paper 

"Contextual Values of Observables in Quantum Measurements" 
by J. Dressel, S. Agarwal, and A. N. Jordan, 

which will henceforth be abbreviated DAJ. It introduces ideas which I found 
very interesting. I also found them quite puzzling because it seemed that they 
might be in some way contradictory to the results of my recent work [3] [4] and 
to prior and overlapping work of Jozsa 12 . (DAJ cites all of these works, but 
does not note the possible contradiction.) 

■^The reader is probably wondering wliy I don't simply ask the authors for their definition 
of "minimal disturbance limit" . I have tried repeatedly, but they ignore my inquiries. See the 
Afterword to Version 3 (reprinted below) for more information. 
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Every prudent author knows how easily mistakes are made and devises tests 
for the internal consistency of hi^ work. This note is largely a product of 
a search for a definite contradiction between DAJ and my work. A definite 
contradiction eluded me because DAJ does not state its hypotheses in a mathe- 
matically explicit way. However, for a period of weeks I had the uncomfortable 
feeling that a contradiction might be "right around the corner" , invisible now 
but ready to pop into being as soon as I sorted out the implicit assumptions of 
DAJ. 

I found a number of mathematical errors in the argument of DAJ, but none 
of these seemed crucial. I worried that with appropriate definitions and quali- 
fications, the main thrust of DAJ's argument might survive to produce a con- 
tradiction. 

Finally, I focused on one step in DAJ's argument and decided that not only 
did I not know how to justify this step, but that DAJ's formulation of the step 
seemed so vague that it should be considered a major gap in their proof. It 
would be foolish for me to claim that there is no way to fill this gap, but I 
would be very surprised if it could be filled so as to obtain their conclusions 
under reasonably general hypotheses. 

This work will attempt to present the argument of DAJ in a straightforward 
way which avoids some of their unnecessary and distracting subarguments, with 
the aim of clearly exposing the gaps in it. I leave it to the reader to formulate 
hypotheses under which the gaps can be filled. 

The conclusion of their paper which initially interested me most was their 
equation (7), which states that the "traditional" weak value 

A. = 5^%^ (100) 

is "uniquely defined" by DAJ's theory of "contextual values" to be essentially 
this expression (more precisely, a generalization of this expression to mixed 
states, their equation (7)), in some undefined "minimal disturbance limit" Q 
This reinforces a general impression given by much of the "weak value" literature 
that the traditional weak value (llOOp is somehow inevitable. 

It seemed to me that the correspondence between DAJ's theory of "contex- 
tual values" and traditional "weak measurement" theory was close enough that 
if (|100p were "unique" in the DAJ theory, then it ought to also be unique in 
"weak measurement" theory, but I believed I had proved via rigorous math- 
ematics j3j that the latter was false. I couldn't rest until I had sorted out 
the correspondence between the two theories with the object of resolving the 
potential contradiction. 



^or her, of course. I adhere to the long-standing and sensible grammatical convention that 
in contexts like this, "his" , "hers" , "his or hers" , or "hers or his" all carry the same meaning. 

^Equation (100) is the real part of DAJ's equation (1). Those familiar with the "weak 
value" literature will probably immediately guess the meaning of the undefined symbols, which 
are also undefined in DAJ. ipi is an initial state in which the expectation of an observable A 
is to be measured, and i/^j^ is a "postselected" final state. For more extensive definitions, see 
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Part of this paper explores the correspondence. I regard this as its more 
interesting and fundamental contribution. Along the way, the potential contra- 
diction was resolved. 

3 Acknowledgment 

The following is reprinted from Version 2: 

Acknowledgments are usually placed at the end of a paper, but this one is 
being placed at the beginning because it may affect how readers view various 
statements in it. I sent a draft of the paper to the authors of DA J asking for 
their comments, with the hope of correcting any errors or misunderstandings. 
They sent me several helpful replies, for which I thank them. 

All of the replies were prominently labeled "Private Comments". Because 
of this, I don't feel at liberty to quote them, nor attribute them to the authors 
in paraphrased form. 

The original draft contained a statement that their equation (4) should be 
considered as a hypothesis for their (6), rather than the assertion that the text 
of DA J seems to suggest. They confirmed that this was their intention, and 
specifically authorized me to state that, so I have. As of this writing, they have 
not specifically authorized me to divulge anything else, so I haven't. 

However, they did give me quite a bit of information which mainly confirmed 
what I had conjectured when I sent them the draft. Consequently, this version 
is little changed from the original draft. In particular, I still believe that the 
gaps mentioned above are substantial. 

Everything that I know is incorporated in the present version, though I can- 
not attribute specific statements to the authors. It may be helpful to readers 
to understand that I know more than I can publicly say. 

The following is added in Version 3: 

Since Version 2 was placed in the arXiv, significant new information about 
DAJ's definition of "minimal disturbance limit" has surfaced. This is presented 
in an "Afterword", which the reader may want to scan before proceeding. In 
summary, they are using their own definition of this term which is not fully given 
in DAJ and does not correspond to the usual English meanings of "minimal" 
and "disturbance" . In particular, their concept of "minimal disturbance limit" 
is distinct from the usual concept of "weak limit" . 

To reflect this, the title of Version 2 has been changed and its abstract 
rewritten. The body of Version 2 is still essentially accurate, and has been 
altered only by correction of typos and a few potentially misleading phrases. 

The sentence 

"It may be helpful to readers to understand that I know more than 
I can publicly say" 
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referred to the fact that an attenipted proof that they had sent me for a major 
claim of DAJ (that their (6) imphes (7) in "the minimal disturbance limit") 
was wrong. Up to that point, I had made no promises of confidentiality, so that 
there is no reason that I could not have revealed that, but I felt that I should 
not. Perhaps given time they could correct the proof. 

But the situation has changed. Months have passed, and they have ignored 
direct inquiries asking if they still believe they have a valid proof that (6) implies 
(7) in their "minimal disturbance limit" . I advise anyone who may be thinking 
of investing time in building on the work of DAJ to first seek a proof from the 
authors and judge for themselves. 

4 General introduction and remarks on notation 

I shall attempt to make this paper as self-contained as practical, but to read it 
in detail, the reader should have at least a general familiarity with DAJ. Since 
DAJ contains some mathematical material which seems to me irrelevant and 
distracting, it may save some time to skim the present work first, then read 
DAJ, then return to the present work. 

Also, DAJ, though nominally self-contained, implicitly requires some famil- 
iarity with the ideas of "weak values" of quantum observables. The seminal 
paper introducing "weak values" is [3, but this contains some questionable 
mathematics and is set in an infinite-dimensional context which (in the light 
of subsequent simplifications) is unnecessarily complicated. A leisurely presen- 
tation of my view of the main ideas can be found in [3] and a more compact 
presentation in the style of research papers in [?] . 

DAJ employs a common physics notation which seems to me overly compli- 
cated, hard to typeset, and not especially well-suited to the presentation of fun- 
damental ideas. However, since the present work is largely an analysis of DAJ, 
I hesitate to entirely abandon their notation. After some thought, I decided to 
use my preferred notation (the notation of and U) for the preliminary and 
subsequent discussions, but to switch to theirs for the detailed analysis of their 
paper. Equation numbers (1) through (9) will be reserved for equations in DAJ 
(which have the same number there) ; new equations in the present work will be 
given numbers (100) and above. 

The next section reprints the "Notation" section of [3j. A following "Pre- 
liminary Summary" section briefly sketches my view of traditional "weak mea- 
surement" theory as it relates to DAJ. The analysis of DAJ starts in Section [71 
and readers already familiar with DAJ may prefer to start there. 

Finally, perhaps an apology concerning typography is in order. I tried to 
use the notation of DAJ in analyzing it, but had some problems duplicating 
their symbols exactly using laTeX. They use boldface for operators, but when I 
specified boldface in laTeX "math mode" , the symbols came out in the "text" 
font rather than "math font"; e.g., E instead of E. I wrote the paper that way, 
and only later found out how to exactly reproduce most of their symbols. I have 
not attempted to correct this because the existing symbols seem close enough 
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to theirs and changes sometimes have unanticipated effects. Anyone who has 
worked with TeX/laTeX should understand. 

5 "Introduction and notation" reprinted from 

m 

We assume that the reader is familiar with the concept of "weak value" of a 
quantum observable. This concept was introduced in the seminal paper [7] of 
Aharonov, Albert, and Vaidman, called "AAV" below. It will be briefly reviewed 
below, and a much fuller presentation intended for those unfamiliar with weak 
values can be found in [3] . 

We attempt to stay as close as possible to traditional physics notation, re- 
verting to notation more common in mathematics only when it seems less am- 
biguous or complicated. Our mathematical formulation of quantum mechanics 
generally follows that of Chapter 2 of the book of Nielsen and Chuang 5 , with 
differences in notation noted below. 

The inner product of vectors v,w in a complex Hilbert space H will be 
denoted (v^w), with the physics convention that this be linear in the second 
variable w, and conjugate-linear in the first variable v. The norm of a vector v 
will be denoted as \v\ := {v,v)^/'^. 

Technically, a (pure) "state" of a quantum system with Hilbert space H 
is an equivalence class of nonzero vectors in H , where vectors v,w € H are 
equivalent if and only if w — av for some nonzero constant a. However, we 
informally refer to vectors in H as "states" , or "pure states" , when we need to 
distinguish between pure states and "mixed states" (see below). A state v is 
said to be normalized if \v\ = 1. We do not assume that states are necessarily 
normalized. 

The projector to a subspace E will be denoted Pe, in place of the common 
but unnecessarily complicated physics notation where {e^} is an 

orthonormal basis for E. When E is the entire Hilbert space of states, Pe 
is called the identity operator and denoted / := Pe- When E is the one- 
dimensional subspace spanned by a vector w, we may write P^ for Pe- When 

= 1, P-uoV = {w,v)'w, but the reader should keep in mind that under our 
convention, = Pw/\w\j so this formula for P^ only applies for \w\ = 1. 

Mixed states are represented by "density matrices" p : H ^ H, which are 
defined as positive operators on H of trace 1. A pure state h £ H corresponds 
to the density matrix Ph- 

We shall be dealing with a quantum system S in which we are primarily 
interested, which will be coupled to a quantum "meter system" M . We make no 
notational distinction between the physical systems S and M and their Hilbert 
spaces. 

The composite system of S together with M is mathematically represented 
by the Hilbert space tensor product S ® M . We assume the reader is generally 
familiar with the mathematical definition oi S ® M . The highlights of the 
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definition are as follows. 

Some, but not all, vectors in S ^ M can be written in the form s (gi m with 
s G S and m e M; these are called "product states" . Typical physics notation 
for s ® TO might be |s)|to) or \s)s\m)M- Every vector v m S ^ M is a. (possibly 
infinite) linear combination of product states : v = ■Sj <8) rnj. 

If p is a density matrix on S<^M, its partial trace with respect to M, denoted 
TrM p : 5 ^ 5 is a density matrix on S. The (mixed) state of S corresponding 
to p is Tim p- 

6 Preliminary summary and intended audience 
for this work 

Prospective readers will naturally want to know what will be the payoff for their 
valuable time invested in understanding this paper. This section attempts to 
address this issue in a more complete way than an abstract. 

The abstract of DAJ states: 

"We introduce contextual values as a generalization of the eigen- 
values of an observable that takes into account both the system 
observable and a general measurement proeediire. This technique 
leads to a natural definition of a general conditioned average that 
converges uniquely to the quantum weak value in the minimal dis- 
turbance limit. As such, we address the controversy in the literature 
regarding the theoretical consistency of the quantum weak value by 
providing a more general theoretical framework and giving several 
examples of how that framework relates to existing experimental and 
theoretical results." 

I was intrigiied by this abstract and assume others may be also. Such potential 
readers, should be aware that questionable mathematics renders some conclu- 
sions of DAJ unproved, and some are arguably false. In particular, the present 
work presents a coTinterexample to their claim that their "general conditioned 
average" "converges uniquely to the quantum weak value in the minimal dis- 
turbance limit" (using a definition that seems appropriate for their otherwise 
undefined "minimal disturbance limit"). 

However, I still find their ideas intriguing, and I wonder if a reformulation 
might yet prove useful. Those who enjoy playing with ideas may find some 
interest both in DAJ and in the present work. 

Following is a thumbnail sketch of my interpretation of the arguments of 
DAJ and my conclusions about them. They will probably not be meaningful to 
someone unfamiliar with weak values. It is hoped that they may give the reader 
a sense of what is to come, should he decide to invest the time in a careful 
reading. 

First we review some key ideas of weak measurement, suppressing technical 
details. We are given the Hilbert space S for a quantum system of interest 
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(also denoted S), an arbitrary number of copies of some pure state s of S, 
and an observable (Hermitian operator) A on S. Our object is to measure the 
expectation (s, As) of A in the state s, denoted (s, As) while negligibly changing 
the copies of s. 

In the "weak measurement" literature, this is usually accomplished by cou- 
pling S* to a "meter system" M, obtaining a composite system mathematically 
described as the tensor product S (S) M oi S with M, and measuring a new ob- 
servable i? on M (actually, measuring I ^ B on S ^ M) in a slightly entangled 
normalized (pure) state e = e(s, e) of S' €3 M, depending on s and a small real 
parameter eO Here "slightly entangled" means that lim^-^Q Ti m Pe(s,e) — Ps, 
where Py denotes the projector to the one-dimensional subspace spanned by the 
vector V. 

It is possible to choose B and e(s, e) so that 

lim TrA, = Ps (101) 

and 

(s. As) = lim(e(s, e), (/ (B/e)e(s, e)) . (102) 

Measuring B or B/eis essentially the same as measuring the spectral projector- 
valued measure {Qj}"^i associated with B^ The result of the measurement of 
the spectral projectors is a new (unnormalized) state {I ®Qj)e with probability 
\{I ®Qj)e\'^. The (generally mixed ) state of S is then obtained by tracing over 
M: 

(unnormalized) state of 5 = Tr^/ [-P(/®Qj)e] 

In practice, the map s H> e(s, e) is usually obtained by applying an e- 
dependent isometr}^ [/(e) : S S ® M io s: e(s, e) := U{t)s. Assuming 
this, for fixed e, the map Ps i— S- Pe(s.c) is easily seen to extend to a completely 
positive map from (mixed) states of S to states oi S ® M . Since tracing is also 
completely positive, the composite map 

Ps Pe{s,t) ^ TrM -P(/®Qj)e(s,e) 

extends to a completely positive map on states of S. Then Stinespring's [8] or 
Choi's [2 theorem implies that, for each j there exist operators Mj^2, ■ • ■ Mj,kj 
on 5, J = 1, . . . , m, such that for all s G S, 

TTM[PmQ,Pe(s.)]=i2M,.PsM]^.^ . (103) 

1=1 

^DAJ calls this parameter g, a fact which I had not noticed while writing the above. I have 
decided not to change the present e to g because that risks introducing errors in case some 
substitutions are overlooked. 

^Here we are taking some liberties for ease of exposition: Strictly speaking, we are mea- 
suring I ® B and {/ ® Qjjrather than B and {Qj}, and we are implicitly assuming a finite- 
dimensional context so that the spectral measure of B is finite. 

^An isometry V from a Hilbert space H to a, Hilbert space K is an operator which preserves 
the inner product: {Vhi,Vh2) = {hi,h2) for all h\,h2 G H. The difference between an 
isometry and a unitary operator is that the isometry is not assumed to map H onto K, i.e., 
VH need not be all of K. 
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For simplicity of notation, I prefer to leave states unnormalized unless the 
context requires normalization. After normalization, the operators Mj^i will 
additionally satisfy 

m 

j2J2K^Mj-^^^ , (104) 

3=1 *=1 

where / denotes the identity operator. 

It is common in the physics literature (though mathematically incorrect) 
to pretend that the sum in equation (|103p only has one term, in which case 
one may rename the Mj^i as Mj and (assuming (|104p ) call them "measurement 
operators" H DAJ assumes this simplification without mention, and for nota- 
tional simplicity, we shall also. No essential change in most (but not all) of the 
arguments of DAJ would be necessary if the more general (|103p were usedlf] 

Assuming this simpification, with the projector- valued measure {Qj} for B 
is associated the positive-operator-valued measure (henceforth called a POVM) 
{mJMj}. DAJ renames this POVM as {Ej}, with Ej := M^Mj. 

Then when 5 is in a pure state p ~ Pg, the probability pj that the j'th 
measurement outcome occurs is 

= Tr [(/ Qj)Pei,^e)] = Trs Ttm [{I ^ Qj)Peis,e)] ~ Tr^ [E,P,] , 

with the approximation becoming exact as e — 0. The corresponding equation 
for 5 in a mixed state p in the limit e is 

Pj = Tr [Ejp] . 

DAJ emphasizes that what is mainly experimentally accessible are the prob- 
abilities Pj, and these may be obtained either from the projector- valued measure 
associated with the Qj or the POVM {Ej}. The numerical values aj which are 
averaged with the pj to obtain the expectation oi B/e are less fundamental. 

To put the matter more picturesquely, imagine that measuring B/e entails 
recording the position of a meter pointer on a scale (with only finitely many 
pointer positions possible due to our assumption of finite dimensionality). We 
can change the scale on the meter if we want, effectively obtaining a new meter. 
If the original meter did not always average to the expectation Tr [Ap] of A in 
the state p, we could ask if it were possible to change the scale so that it did 

®This may have something to do with the fact that the influential and generally excellent 
2000 book [5] of Nielsen and Chuang does this without mention, referring to the (renamed) Mj 
as "measurement operators". Strangely, a 1997 paper coauthored by Nielsen ("Quantum 
measurements" section, equation (3.1)) clearly indicates that he was perfectly aware that 
1I103II is the correct formulation. 

^DAJ mainly uses only the positive-operator- valued measure (POVM) Ej^i := ^Mj^i 
associated with the which for many of their purposes can be replaced by a new singly- 

index POVM Ej := ^^Mt^Mj^^. For issues regarding "weak" measurement (which they do 
not treat in a precise way), it would probably be necessary to use the more general doubly- 
indexed form. 
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average to Tr [Ap] for all states p. The new scale values (if they exist) are called 
"contextual values" by DA J. 

Since reading the meter in M is the same as assigning real values to the 
outcomes j of the POVM {Ej} in S we could dispense with the M meter and 
work entirely in S, in terms of a given measurement operators {Mj}. DAJ 
explores the possibilities of such an approach. 

Note that in this formulation, it would not be necessary to assume that the 
measurement is "weak" (i.e., that it did not significantly alter the state of S), 
and through their equation (6), DAJ does not make this assumption. However, 
outside the context of weak measurement, it seems unclear what would be the 
advantage of using general measurement operators in place of normal "strong" 
measurements obtained from the spectral measure of A, and DAJ does not 
discuss this question. 

Eventually, DAJ adds a "weakness" assumption to make contact with tra- 
ditional "weak measurement" theory. Unfortunately, their particular notion of 
"weakness" (which they call "minimal disturbance") is never clearly defined in 
their paper, forcing the reader to guess at their definitions. We shall point out 
that under some reasonable guesses, their conclusions would be false. 

Some believe the basic idea of traditional "weak measurement" theory to be 
of questionable utility (cf. the "Remarks" section of [4j). However, those who 
do think it potentially useful may find attractive DAJ's idea of doing it entirely 
within the original system S of interest, without reference to an external "meter 
system" . Those interested in extending the work of DAJ may be interested in 
learning of its gaps from the present work. 

7 The main argument of DAJ 

We shall use the notation of DAJ and reproduce their equations as written there. 
We also use their equation numbers, which range from (1) to (9). Thus this 
section will probably only be meaningful to readers who have some familiarity 
with DAJ and have it at hand. In addition, we assume the reader has at least 
skimmed the "Preliminary summary" section above. 

DAJ attempts to implement weak measurements via "measurement opera- 
tors" A4 = {Mj }, seemingly using the definitions of the book [5] of Nielsen and 
Chuang (though this reference is not specifically cited). It is known ([5], pp. 
94-95) that such measurement operators can always be obtained by coupling 
the system S of interest to an ancillary "meter system" M obtaining a new sys- 
tem S (8> Af), performing a projective measurement in M, and then tracing 
out M to obtain measurement operators on S, as described in the "Preliminary 
summary" section. Thus I would not expect to obtain anything essentially new 
from the formalism of DAJ, though it might turn out to be more insightful or 
convenient than traditional "weak measurement" approaches. 

The discussion of DAJ on their page 2 attempts to obtain the expectation 
{A) in a given state p of an observable A (corresponding to operator A) from 
the above measurement operators and the associated positive-operator-valued 
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M^M,, which leads to their equation 



(2) 



3 3 



for real numbers ai,a2, ■ ■ ■ ,olm called contextual values and abbreviated CV. 
(The Pi in the middle term represent probabilities and should be Pj.) 

Denoting the eigenvalues of the operator A by 01,02, .. . ,Om, and the cor- 
responding spectral projectors by Ilj, equation (2) leads to their equation (4): 



which in turn leads to a system of m equations in N unknowns. I initially 
interpreted the subsequent discusssion as claiming that (4) always has a solution 
(obtained from the Moore- Penrose pseudoinverse) when N > m, and I still think 
the paper does indeed give this impression. However, the authors have informed 
me (and authorized me to say) that instead, (4) was intended as a hypothesis for 
the subsequent discussion, and that the sentence just below (4) was intended to 



They then propose (without giving any substantial reason) that "the physi- 
cally sensible choice of CV [contextual values] is the least redundant set uniquely 
related to the eigenvalues through the Moore- Penrose pseudoinverse" . ( "Least 
redundant set" is not defined.) A long paragraph then presents the Moore- 
Penrose pseudoinverse in a complicated way which obscures its essential sim- 



// there is a solution a = (ai, . . . ,a7v) to (4) (as they assume), then indeed 
(Jo := F+o is also a solution, where F+ denotes the Moore-Penrose pseudoin- 
verse. But it is obscure to me why F+a should be considered as better or more 
fundamental than any other solution. It seems to me that the whole discussion 
of the Moore-Penrose pseudoinverse is distracting and essentially irrelevant to 
the rest of the paper. 

Before continuing with the arguments of DA J, let us put into perspective 
some ideas which have been presented previously, both in this and the "Pre- 
liminary summary" sections. DAJ assumes that (4) always has a solution a. 
(Whether it is given by the Moore-Penrose pseudoinverse is irrelevant.) "Weak 

^"A simple example for which (4) has no solution is Ej := I /N with A not a multiple of 
the identity / (and using a trace normalized to Tr / = 1). 

^^In fairness to the authors, it should be noted that such complicated presentations are com- 
mon in the physics literature, so they are hardly alone. Their presentation uses the singular 
value decomposition which is presented in a similarly complicated way in the influential and 
mostly excellent book 5 of Nielsen and Chuang. And the presentation of the Moore-Penrose 
pseudoinverse in Wikipedia as of this writing (Jan. 23, 2011) is similarly and unnecessarily 
complicated. 

An appendix discusses the Moore-Penrose pseudoinverse in the simple way that most mathe- 
maticians probably view it. I took the time to write it in the hope that it may be helpful to 
those unfamiliar with this and similar concepts. 




(4) 




plicityl 
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measurement" theory gives something analogous for approximate sohitions to 

More precisely, weak measurement theory gives measurement operators Mj = 
M.j{g) and contextual values aj — aj{g) which depend on a small "weak mea- 
surement" parameter g and which approximately satisfy (4), the approximation 
becoming exact in the limit g ^ 0. In addition, the disturbance of a given state 
p by the measurement goes to as g — 0. 

Therefore, in anticipation of applications of contextual value theory to weak 
measurement theory, we ought to have in mind measurement operators 'M.j{g) 
and contextual values otj{g) which depend on a small parameter g. DAJ does 
introduce this generalization later. 

Before continuing the general discussion, it will be enlightening to touch 
on the paper's example entitled "Photon polarization" which interprets an ex- 
periment of Pryde, et ai, jTO] in the context of CV's. Here the measurement 
operators and associated POVM can be obtained in the exactly the way de- 
scribed in the "Preliminary summary" section, though DAJ does not indicate 
how they were obtained. And, the CV's and POVM do depend on g. There are 
just two CV's denoted a+{g),a-{g)^ and it turns out that a±{g) = il/g. This 
example teaches us that in general, we can expect limg^Q aj{g) — oo, and we 
have to expect that arguments which would work for bounded aj{g) may fail. 

Now we continue with the paper's argument. It renames the previous mea- 
surement operators, M — {Mj} as M.^^^ = {M^^''} and considers a second set 

of "postselection" measurement operators which it denotes MS^^ = {M^^''}. 

Though it does not say so, this second set of measurement operators is 
presumably {P/, (/ — -P/)}, where P/ is the projector on a postselected final 
state / G 5o Anyway, this is the case of interest in applying DAJ's theory of 
contextual values to traditional weak measurement theory, and for simplicity I 
shall assume this case below. 

It considers first making a measurement described by MS^^ ^ then a sub- 
sequent measurement described by MS"^^ . The composite measurement is de- 
scribed by measurement operators which DAJ denotes A^^^'^^ — {M^^''My^^}. 

The associated POVM is {U^f^ = M^-^^^M^^^^M^^^M^^^} Before continuing, 
note that in the traditional context of weak measurements, these composite 
measurement operators correspond to making an initial measurement in the 
meter system M and then postselecting in S. 

DAJ then states what it characterizes as its "main result" , its equation (6) 
giving the "conditioned [on successful postselection] average of [A]": 

i E,Tr[Ef/)p] 



Actually, DAJ appears to use / as an index rather than as a postselection vector, but 
our change of notation seems likely to cause less confusion than renaming the postselection 
vector something like h, in which case DAJ's ' would equal for some choice of index /. 
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In the context of traditional weak measurement theory, equation (6) gives the 
average value of the meter measurement (whether "weak" or not) conditional 
on successful postselection. Since the measurement operators depend on what 
kind of meter measurement is performed, it may be potentially misleading to 
characterize (6) as a conditional average of A. Given the measurement opera- 
tors, the right side of (6) surely is the "conditioned average" of meaurements 
derived from these operators, but different measurement operators satisfying 
(4) may conceivably give different conditional averages. DAJ does make this 
clear in the surrounding text, but it seems to me that denoting any of these 
conditional averages by a symbol like f{A) which involves only A and / and 
not the measurement operators invites reader misinterpretation. For instance, 
in the "Photon polarization" example of Pryde, et al. [10] mentioned above, the 
measurement operators arose from the particular meter measurement chosen, 
but had Pryde, et al., chosen a different meter measurement, a different weak 
value ("conditioned average") might have been obtained. 

DAJ claims that (6) implies (7), under a hypothesis that "the state [p] is 
minimally disturbed" which they never precisely define: 

Tr[Ef{A,p}] 
2Tr[Efp] ' ^' 

where {■, ■} denotes the anticommutator, i.e., {B,C} :— BC + CB. A little 
algebraic manipulation shows that this is a generalization to mixed states p of 
the traditional weak value (jlOOl) for pure states. 

I cannot follow DAJ's argument leading from (6) to (7), and have been unable 
to guess a precise meaning for the "minimally disturbed" hypothesis which could 
enable their argument. This is the possibly fundamental gap in their argument 
previously mentioned. A later section will present a counterexample showing 
that (6) does not imply (7) for a generalization of their "Photon polarization" 
example, using what seems to me a reasonable (and nearly unique reasonable) 
definition of "minimal disturbance" . 

Recall the earlier remark that in a "weak measurement" setting, the measure- 
ment operators Mj = Mj{g) and contextual values aj = OLj{g) must implicitly 
depend on a small real parameter g corresponding to the "weakness" of the mea- 
surement. Up to equation (6), this parameter has been unrecognized in DAJ 
(because their setup does not require that measurements be "weak"). However 
the paragraph of DAJ following (6), labeled "Weak values", does explicitly in- 
troduce the parameter g and sketches an argument intended to show that (6) 
implies (7) under an additional "minimal disturbance" hypothesis which they 
never precisely formulate. The following intends to summarize and analyze their 
argument which I cannot follow in detail. 

They write the original measurement operators as M.^j^\g) — 'M.j{g) = 

~ 1/2 - . . 

\Jj{g)Ej (g), where Uj(g) is unitary. This polar decomposition is surely pos- 
sible. Then they claim that Stone's theorem can be applied to write iJj{g) = 
expiigGj) for Hermitian operators Gj. But Stone's theorem only applies to uni- 
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tary groups, i.e., it assumes that tJj{gi + 32) — j {91)^ 0(92), which is surely 
not true in the general context of DAJ. For example, if it happened to be true 
for some particular measurement operators, it could be made false for others by 
nonlinear ly rescaling the parameter g. 

Even if it were true, I don't understand its relevance to the problem at 
hand. They go on to state that ". . . if [Gj,p] = 0, so the state is minimally 
disturbed, then . . . the generalized WV [their ref. [13]] is uniquely defined as 

2Tr [Ef^p] 

where {•, •} denotes the anticommutator." 

But shouldn't "minimal disturbance" require that p (nearly) commute with 
the Ej as well as with the \Jj , or equivalently, with the measurement operators? 
If so, it is hard to understand the utility of introducing the polar decomposition. 
Even supposing that p can be assumed to (nearly) commute with the Uj, do 
the authors additionally assume its (near) commutativity with the Ej? In that 
case, why not equivalently assume (near) commutativity with the measurement 
operators to start? And even if one does have some sort of near commutativity 
with the measurement operators, since the contextual values aj^^ (g) can become 
unbounded for small g, some additional argument would seem necessary to 
obtain DAJ's conclusion that (6) converges to (7). 

Finally, let us expand on the "minimal disturbance" assumption. The most 
plausible way to make this precise seems to me the following. 

If the j-th measurement outcome is obtained, the subsequent state is 

M,-(.9)pM,-(.9)t 
Tr [M,(g)/5M,(5)t] ' 

assuming that the denominator does not vanish. So, I would define "minimal 
disturbance" to a state p as 

lira ^^ME^^I^ = p for all,, (105) 



9-0 Tr [M,{g)pM,{g) 



assuming that the denominator does not vanish identically in a neighborhood 
of g = OEI 

A later section gives a counterexample showing that (6) does not imply (7) 
under this definition of "minimal disturbance" . I know of no other plausible 
definition under which DAJ's argument that (6) implies (7) would make sense. 



^•^ If the denominator does vanish for some (but not all) g near 0, the limit is to be understood 
as taken over all g for which the denominator does not vanish. 

If the denominator vanishes for all g near 0, then we leave "minimal disturbance to a state 
p" undefined. Physically, this would correspond to zero probability for result j for small g, so 
that for all practical purposes, for that state p, the result j could simply be deleted from the 
list of possible measurement results. 



15 



8 Hermitian measurement operators 

This section examines the special case of Hermitian measurement operators Mj 
under our "minimal disturbance" assumption (jl05l) . For example, this is the 
case for DAJ's "Photon polarization" example. The counterexample of the next 
section will require the result of this example, and it is enlightening and not 
much more trouble to work out part of it in a more general context. 

To avoid distracting degenerate cases (cf. the footnote to the definition 
of "minimal disturbance" of the last section), it will be assumed below that 
the denominator of our proposed "minimal disturbance" condition (|105|) never 
vanishes. For the application of the results of this section to the counterexample 
of the next section, no degenerate special cases occur, and this assumption is 
unnecessary. 

Recall that for simplicity we are assuming that the final measurement is 
postselection to a final state f G S, so that the only final measurement oper- 
ator of interest is Pf. This makes the superscripts (1) and (2) on the various 
quantities in (6) superfluous and allows us to write (6) in the simpler-appearing 
form: 

{A)^^^ ^ . / , (106) 

E,Tr[P/M,pMt] 

where we have used Pj = Pf and the cyclic property of the trace to suppress a 
Pf on the right. 

Including the "weak limit" parameter g, and using the "minimal distur- 
bance" assumption (|105p . the denominator of (|106l) can be written in the limit 
g — as 

limVTr[P/M,(5)pM](g)] = 

j 

lim y Tr [Pf ^ /5 Tr [M,(g)/}M (5)] 



hm VTr [Pfp\TT [M, (g)pM](g)] 



= Tr [P/p]hmTr yM,(.9)tM,(g)/) 
j 

= Tr[P/p] , (107) 

because J^^jigY'^jig) — I ^-nd Tr p = 1. This is half the denominator of 
DAJ's final result (7). Note that this simplification of the denominator did not 
assume Hermiticity for the measurement operators; it will be needed later for 
non-Hermitian measurement operators. 

Next note that part of the numerator of (6) can be written as: 

M,pM] = (1/2) ( M,M]p + pM,M] ) + (1/2) ( [M, , p]M] + [p, M]] ) , 

(108) 
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where the brackets denote commutators: [B, C] := BC — CB. The full nu- 
merator of (6) or (jl06|) similarly decomposes into two terms corresponding to 
(UMl): 

numerator of (6) — 

(l/2)Tr [Pf J2 «.{M,m1/5 + pM,M]}] 

3 

+(l/2)Tr \Ps J2 ( [M, , p] M] + M, [p, M]] ) ] . (109) 
j 

Recalling that the contextual values aj are assumed chosen so that J2j Q^j^^Mj = 
A and using the assumption of Hermiticity to write MjMj = M^Mj, the first 
term of (|109p simplifies to 

(l/2)Tr[PHA,p}] , (110) 

which is half the numerator of DAJ's (7). Thus we obtain DAJ's (7) under the 
assumptions of Hermitian measurement operators and "minimal disturbance" 
(jlOSp if and only if the second term in (|109p vanishes in the limit 5 — J> 0. 

But ignoring that term seems problematic. The problem is that the contex- 
tual values aj = aj{g) may become large as 5 — J> 0, as occurs for DAJ's "Photon 
polarization" example for which aj{g) — ±l/g. So even if the "minimal distur- 
bance" assumption could somehow be applied to assure that the commutators 
in the second term of ()109p become small as g — > 0, we would still need some 
additional control over the rates of convergence to assure the vanishing of the 
second term in the limit 5^-0. The lack of control over the rates of convergence 
seems the crucial difficulty in passing from (6) to (7), even for the special case 
of Hermitian measurement operators. 

9 Contextual weak values are not necessarily 
the traditional weak value. 

This section presents a counterexample showing that the difficulty in passing 
from DAJ's equation (6) to the traditional weak value (|100[) is essential if we 
accept the definition (llOSp of "minimal disturbance". It is a modification of 
DAJ's "Photon polarization" example on their page 3, but I shall use notation 
easier to typeset o 

This example arises from an experiment of Pryde, et al. |10, . which per- 
forms a weak measurement and obtains the traditional weak value (|100|) for 
that measurement. The measurement operators which DA J denotes M± and 
the contextual values (which are the eigenvalues of the "meter observable" of 

^''Like many physics papers, DAJ distinguishes operators by both boldface and "hats". 
Notationally distinguishing different quantities may be helpful to readers who are skimming a 
paper, but surely either boldface or hats would be enough. For serious readers, neither should 
be necessary, and I will omit them. 
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[TU] ) can be obtained from data given in the paper of Pryde, et al., as described 
in the "Preliminary summary" section around equation (I103p . For this pur- 
pose, one starts with the shghtly entangled state called there e(s, e) as given in 
[10] but not mentioned in DAJ. For the purpose of merely checking that DAJ's 
(6) does imply (7), one can start with the DAJ's measurement operators and 
contextual values without considering their origin, and this is what we shall do. 

DAJ's "Photon polarization" example is set in a two-dimensional complex 
Hilbert space S with a distinguished orthonormal basis with respect to which 
all matrices below will be written. As in DAJ, the operator A for which (s. As) 
is to be weakly measured will have matrix with respect to this basis 



A^ 



(111) 



Let g denote a small parameter let A := + .9)/2, fJ, := ■\/(l — g)/2, so 

A2+^2 = l[ll 

Let M± be as given in DAJ, namely 



M4 



A 
fi 





A 



The contextual values a± — a±{g) given in DAJ are 

a±{g) = ±1/5 , 

and these result in 

A = a+MlM+ + a_i\flM_ = a+All + a^Ul for all g. 
We shall define two new measurement operators depending on g: 
M,{g):^U{g)M+{g) and 1/2(5) := M_(.g) , 



(112) 

(113) 
(114) 
(115) 



with U{g) unitary operators to be specified later. Note that this leads to the 
same POVM {^i M^Mi, E2 := M^Ah} as in DAJ, so that J^j o^j^j = A 
holds in both contexts if we define 

ai{g):=a+{g)^l/g, 02(5) := a_ (5) = -1/g . 

The starting point for the counterexample will be the fact that when U{g) = I 
for all 5, equation (6) of DAJ does lead to the traditional weak value given in 
its equation (7). 

To check this, verify that for an arbitrary mixed state 



P = 



Pii P12 
P21 P22 



(DAJ denotes A as 7 and fi as 7, but I hesitate to do this because the mathematical liter- 
ature generally uses an overbar to denote complex conjugate. Also, I accidentally substituted 
A for 7, and don't want to reset the type. 
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(m - A)p2i 



(A - n)pi2 




and 



[M+,p]M+ + M+[p,M+] 



(A-m)')P2i 







(A - A*) Vi2 




Interchange A and /i to obtain the corresponding expression with M+ replaced 
by M_, and the corresponding expression is the same. Then the expression 



in the second term of (I109p is easily seen to vanish identically because [M+, p]M]^ + 
M+[p, Ml] = [Af_, p]m1 + M_ [p, Ml] but a_ = — q:_|-. Hence the second term 
of (|109p vanishes identically, and then (7) follows. 

Now consider the effect of replacing the Hermitian measurement operators 
M+{g) with new measurement operators Mi{g), M2{g) with 



with U{g) a g-dependent unitary operator to be determined. We shall choose 
U{g) so that limg_j>o U{g) is the identity operator / in order to satisfy our "min- 
imal disturbance condition (|105l) . 

Recall that equation (|106|) gives the expectation / (A) of A in initial mixed 
state p and postselected to the pure state f E S. Let A{g) denote the differ- 
ence between the value given by this equation for the Hermitian measurement 
operators M^{g), M^{g) and for the new measurement operators Mi(g) := 
U{g)AI^{g), M2 M^{g). To show that (6) does not imply (7) in general, 
we shall show that it is possible to choose unitary U{g) and states p, /, such 
that 



Equation (I107|) shows that the limit as g — > of the denominator of (I106P 
is Tr [Pfp] for both sets of measurement operators. The numerator of (|106p for 
measurement operators Mi , M2 is 

aiTr [PfUM+pM+U^] + QaTr [P/M_pAf_] = 

a+Tr [ [P/, t/]M+pM+C/t +a+Tr [UPfM+pM+U^] + a_Tr [P/M_pM_] 
= a+Tr [ [P/, U]M+pM+U^ ] + (a+Tr [P/M+pM+] + a_Tr [P/M_pAf_]). 

The term in parentheses is the numerator of (jlOSp for measurement operators 
M+,M_. 

We shall choose U (g) :~ exp{igH) with H a Hermitian operator which is not 
a multiple of the identity. Then the difference of the limits of the numerators 



Y,c^A[Mj,p]Mj + M,[p,M]]) 



j 



M2{g) := M_(5) and A/i(g) := U{g)M+{g), 



limA(5)^0 

3^0 



(116) 
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of (|106p for the two sets of measurement operators is 

lim Tr [a+(g)Tr [P^, U{g)]M+{g)pM+{g)U\g) ] - (117) 

limTr [{l/g)Tr[[Pf,U{g)]M+{g)pM+{g)U\g)] 

= IV [[Py,*iJ]p] , (118) 

where the limit of the commutator is given by: 

\\M\/g)[Pf,U{g)] = hm[P/,(C/(,g)-J)/.9] 

= [P/,d;7(5)/d.g|3=o] 

= [P/,*i/] . 

We have shown that 

g^o Ir [P/pJ 

It is well-known and elementary to show that the only operators H which can 
commute with all one-dimensional projectors P/ are multiples of the identity 
operator, so there exists some f E S such that [Pf,iH] ^ 0. And then there 
exists some mixed state p, which can be chosen of the form p — Ph, such that 
Tr [[Pf,iH\Ph\ = (/i, [Pf,H]h)/{2\h\^) ^ 00 The last fact follows because it 
is also well-known that the only operator B which can satisfy (/i, Bh) for all 
/i is B = 0; for present purposes we only need this for the Hermitian operator 
B = [Pf, iH] for which it is obvious from the spectral theorem. 



10 Summary and conclusions 

"Measurement operators" on a quantum system S can always be implemented 
by passing to a tensor product S (Si M of S with a ficticious ancillary quantum 
system M, performing a projective measurement in 5 (g) M with orthogonal 
projectors of the form {I'SiQj}, and then tracing out M to obtain measurement 
operators on 5 ([5], pp. 94-95). By "passing to ® M" we mean identifying S 
with a subspace of S' (g) M via an isometry V : S ^ S Si M. 

This gives a way to translate the language of "measurement operators" on 
S into the traditional language of "weak measurement" theory. The projective 
measurement in S" €5 M corresponds to a "meter measurement" in M, where 
{I<S)Qj} are the spectral projectors for a "meter observable" I(E)B. (We speak 
of "weak measurement theory" because we need some name for it, but the trans- 
lation just mentioned does not require "weakness" of measurements deriving 
from measurement operators, and we do not necessarily assume it below.) 

For the inverse translation, one needs to replace the language of "measure- 
ment operators" as expounded in [5] by something more general (cf. equation 



The factor 1/2 arises because we are using a trace normalized to Tr 7 = 1. 
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l|103p ). Thus the language of weak measurement seems more general than the 
language of measurement operators 

DAJ attempts to do weak measurement theory entirely within S, without 
introducing a "meter system" M, replacing it by assuming as given a set of 
measurement operators. DAJ's equation (6) formulates a definition of "condi- 
tioned average" of an observable A on S' in terms of measurement operators, 
which it denotes by notation similar to f{A). Assuming DAJ's equation (4), the 
unconditioned average obtained from these measurement operators is the same 
as the average of A that would be obtained in the usual way from the spec- 
tral projector- valued measure for A, but even assuming (4), the corresponding 
"conditioned averages" need not be the same (as DAJ clearly recognizes). The 
analog of DAJ's (6) in weak measurement theory would be the expectation of 
the meter measurement conditional on successful postselection in S. 

Much of the traditional weak measurement literature blurs the distinction 
between the latter and what one might call the "conditional expectation of A 
given successful postselection" . I think it worth emphasizing that DAJ does not 
make this mistake. Though it does use the potentially misleading notation f{A) 
which suppresses the measurement operators from the "conditioned average" 
obtained from them, it explicitly points out that f{A) does depend on the 
measurement operators, and in general cannot be obtained from A (and the 
initial and final states of S) alone. 

Next DAJ claims that in some "minimal disturbance" limit which it does 
not precisely define, the dependence of its "conditioned average" f{A) on the 
measurement operators washes out, and the limit is the traditional "weak value" 
(7) which depends only on A and the initial and final states in S. 

However, I think that its conclusion that f{A) necessarily converges to the 
traditional weak value (|100l) in some "minimal disturbance limit" is still un- 
proved and probably false under definitions of "minimal disturbance limit" 
which refiect the normal meanings of the English words. The present work 
proposes a definition of "minimal disturbance limit" which seems physically 
compelling and under which f{A) can converge to something other than the 
traditional weak value (|100p in this limit. 

11 Afterword from Version 3 

Since writing the above, Tve discovered some misunderstandings which I need 
to correct. Nothing is wrong in the preceding, but I know more now than when 
I wrote it. Revising the entire manuscript would not be worth the effort, so 
instead I am adding this Afterword. 

I acquired this additional insight in the following way. I urged the authors 

DAJ's claim that "the WV [weak value] can be subsumed as a special case in the CV 
[contextual value] formulation" seems to me overreaching. It seems difficult to pinpoint a 
precise logical relation between "contextual value" and "weak value" theory due to lack of 
systematic exposition of both in the literature. However, intuitively, I regard weak value 
theory as more general. 
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of DAJ to submit an Erratum to Physics Review Letters (PRL) correcting 
the many errors and misleading obscurities in the original. I was particularly 
concerned that an attempted proof they had sent me that (6) implies (7) for 
positive measurement operators was incorrect. After they made clear that they 
had no intention of doing so, I submitted a "Comment" paper to PRL. 

As is the usual policy of PRL, they first sent the "Comment" to the au- 
thors, who returned a lengthy reply. It was much clearer than the two previous 
communications which they had sent me, and in its light, I saw that I had not 
properly understood their usage of the term "minimal disturbance limit" . I still 
don't fully understand the sense in which they use this term, but I have a better 
idea than before. I cannot improve my understanding because further inquiries 
to the authors have gone unanswered. 

Before continuing, I want to make clear that nothing I say from here on 
is authorized by the authors of DAJ. It reflects nothing more than my best 
understanding based on what they have sent me and PRL. Most of it comes 
from their reply to PRL. A draft has been sent to the authors of DAJ. They 
have not replied. 

Their response to PRL claims that the term "minimal disturbance mea- 
surement" has a precise meaning in quantum measurement theory to refer to 
"Hermitian" (presumably, they mean "positive") measurement operators, and 
they cite this usage in the recent book of Wiseman and Milburn [11] (which is 
not in the reference list of DAJ). The book does define the term in this way, but 
a search of the arXiv and Internet for other instances of this or related terms 
(like "minimal disturbance limit") and an inquiry to the prominent Internet 
"bulletin board" sci. physics. research have failed to turn up a single additional 
reference which uses the term in a context similar to DAJ or Wiseman/Milburn. 

They go on to say that they generalize this usage to mean that [Gj , p] = 
0, under the assumption that tjj{g) = ex-p{igGj), and they claim that this 
constitutes a precise definition in DAJ of "minimum disturbance measurement" . 
The only reference I can find to this generalization in DAJ is the following 
sentence just before (7): 

"However, if Vj, [Gj, p] = 0, so the state is minimally disturbed, then 
the context dependence vanishes and ... " . 

I think it would take a mind reader to guess that this is a definition of "minimally 
disturbed" . The syntax would normally be interpreted as a statement that if 
[Gj, p] = 0, then the state is minimally disturbed, where "minimally disturbed" 
had previously been defined (either by a technical definition or implicitly by 
usual English meanings of "minimally" and "disturbed"). That is how I had 
interpreted it. 

Note that even if we accept this as a definition of "minimal disturbance 
measurement" , we woud still need a definition of the "minimal disturbance 
limit [emphasis mine]" which DAJ claims as a hypothesis for (7). There is no 
limit in the condition [Gj,p] = 0, and this condition is automatically satisfied 
for positive measurement operators, so some further "limit" condition (perhaps 
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similar to (llOSp ) is still necessary. Repeated inquiries to the authors asking 
what condition they are using have gone unanswered. 

Note also how Uj(ff) = exp{igGj) now appears as an assumption rather 
than as the conclusion which the previous text of DA J clearly suggests: 

"To find the weak limit of (6) we note that any measurement con- 
text continuously connected to the identity operation can be decom- 

- 1/2 

posed into the form A4 = XJj{g)'Ej (g), where <? is a measurement 
strength parameter and Vj{g) — expfigGj] . . ." 

This essentially says that any real-parametrized set of unitary operators Uj (g) 
with Uj(0) = / is a 1-parameter group of unitary operators (or, more sympa- 
thetically interpreted, can be reparametrized to become a 1-parameter group). 
But the unitary parts of general measurement operators can be completely ar- 
bitaryl There is no reason that for fixed j, the different Vj{g) should even 
commute, as members of a 1-parameter group must. 

Even assuming that this was intended as an assumption, it is clearly an 
extremely strong assumption which would be expected to hold only in unusual 
cases for measurement operators with Uj(g) ^ /. So it seems that the essence 
of DAJ's hypothesis for (7) is that the measurement operators Mj be positive 
(which implies that they are Hermitian). 

However, this is still an interesting hypothesis. The reader may recall that 
the "Hermitian measurement operators" section was unable to establish (7) un- 
der this hypothesis, and strengthening it to assume positive measurement oper- 
ators would not have helped. What looked like a possibly essential obstruction 
to a proof had already appeared. 

In response to my earlier inquiries, DAJ had sent me an attempted proof 
that (6) implies (7) assuming that [Gj,p] — 0. There was a serious gap in it. I 
asked about the gap, and they sent me an expanded proof to fill the gap. But 
the expanded proof was definitely wrong because there is a counterexample to 
one of its crucial steps, and it looked as if the difficulty might well be essential. 

I sent them the counterexample and have received nothing of substance from 
them since. Despite several direct inquiries, they have neither acknowledged 
that the proof they sent me was incorrect nor claimed that they still think they 
can prove that (6) implies (7) for positive measurement operators. 

I want to acknowledge that DAJ does state that contextual weak values are 
not necessarily the traditional weak value: 

"Writing the initial context in (6) to first order in g, we find that 
as (7 — >■ 0, the weak limit generally depends explicitly on {Gj} and 
{ctj} and thus will change depending on how it is measured and how 
the CV are chosen (see also [11])." 

I somehow overlooked this while writing Version 2, perhaps because it is em- 
bedded in the vague and error-ridden paragraphs leading to (7) of which I could 
make no sense. I have changed the title and rewritten the abstract to reflect 
that my Section [9] example that contextual weak values are not necessarily the 
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traditional weak value was essentially already announced in DA J (assuming the 
guess that DAJ's definition of "weak limit" coincides with my (|105|) ). 

Other than that, I have left the body of Version 2 nearly intact. A few typos 
and potentially misleading phrases have been corrected. 

11.1 Origin of the term "minimal disturbance measure- 
ment" 

Lacking any definition in DAJ, or even a reference to a definition, I had guessed 
that the words "minimal disturbance limit" were synonymous, or nearly so, 
with "weak measurement" as used in works on weak measurement such as [7]. 
This turned out to be an incorrect guess, but it seems an almost inevitable 
misunderstanding given the vague way that DAJ is written. I thought that 
DAJ's abstract claimed that contextual values were always the "quantum [i.e., 
traditional] weak value" in the limit of weak measurements. 

The motivation given in the book of Wiseman and Milburn jTl] for the 
term "minimal disturbance measurement" derives from an interesting paper of 
Banaszek p2]. Using one of many reasonable definitions of "closeness" of states, 
he obtains the average over all pure states of the closeness of a postmeasurement 
state to the premeasurement state, and finds that this average is maximized by 
positive measurement operators. 

But for a particular premeasurement state p it is not necessarily true that 
the postmeasurement state MjpMj is closest to p when Uj = J in the polar 
decompositions Mj — UjHj {Uj unitary, Hj positive) Since DAJ is dealing 
with particular premeasurement states (satisfying their "minimal disturbance" 
condition [Gj , p] — 0) their terminology "minimal disturbance limit" seems not 
only potentially misleading, but actually inappropriate in their context. 

11.2 Possible alternate definitions of "minimal disturbance 
limit" 

Consider measurement operators Mj = M.j{g) with polar decompositions M.j{g) = 
\Jj{g)Ej{g) with "fJj{g) unitary and Ej(g) positive. I am now sure that DAJ's 
definition of "minimal disturbance limit" includes the condition [Gj , p] = 0, 
assuming that Vj{g) = cxp{igGj). This is much more restrictive than 

[U,(<?),p]=0 for all 5 (120) 

(which would not assume that \Jj{g) — exp(igGj)), or 

lim[U,(5),p]=0 , (121) 

3->-0 

which would not require that p commute exactly with the unitary parts of the 
measurement operators, but only in the limit g — >■ 0. My original guess was 



[Tl] Exercise 1.28, Section 1.4.2 
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that the definition of DA J would include something like (|12ip . which seems 
much more natural than [Gj, p] = 0, 

Any positive measurement operators (i.e., Vj{g) — I for all g and j) satisfy 
all these definitions, so some further condition must be necessary if "minimal 
disturbance limit" is to have any meaning even remotely suggested by the words 
"minimal disturbance" and "limit" . The "Preliminary summary" Section [6] 
pointed out that in translating from the "weak measurement" point of view to 
DAJ's "contextual value" point of view, a projective measurement which gives 
result j (i.e., a "meter measurement") is made in a composite system 5 ® M 
and then M is traced out to obtain a completely positive map on 5* which sends 
a premeasurement state p to the (unnormalized) postmeasurement state 

J2M,,,pMl , (122) 

i=l 

the probability of this transition being 



Define 



1/2 

(123) 



call these "measurement operators" , and note that they are necessarily positive. 
For the purpose of defining "contextual values" , one needs only the probabilities 
Pj, which can be obtained from the Mj, and so it might seem that the more 
general language of completely positive maps could be replaced by the more 
restrictive language of "measurement operators" adopted by DAJ. If so, one 
could also assume that the measurement operators are positive, and the issue 
of which of the above definitions ( (fT20l) . (fT2T|) or the [Gj,p] = of DAJ) are 
used for "minimal disturbance measurement would become moot. 

However, for the purpose of defining "weak limit" , or "minimal disturbance 
limit" for positive measurement operators, it may make a difference whether one 
uses the doubly indexed form (jl22p or traditional singly indexed measurement 
operators (jl23p . This is because the (unnormalized) postmeasurement state 
when result j is obtained with measurement operators {Mj} is 

M.pM] , (124) 

which is not generally the same as the postmeasurement state (|122p obtained 
from the doubly indexed measurement operators. For example, it is not clear 
that "doubly indexed" analogs of my "minimal disturbance" condition (|105p . 
such as 

r M,-,,(g)/}M],„(,9) . , „ . . , . ,,,,, 

Imi ^-r = p, tor all i,z, and p, (125) 

s-OTr[M,,,(.g)pMt (g)] 
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or 



^ 5Z (ff )/5M]_, (5) = P, for all p. 



(126) 



imply (llOSp or conversely. 

Another possible approach would be to consider the pairs {j, i), j — 1,2, ... , 
i = 1, ... kj, a.s "primitive results" which could be reindexed with a single index 
if desired. However, since all of the results {j, 1), (j, 2), . . . , (j, in the mea- 
surement on S correspond the the single result j in the meter system M, this 
would require the strange assumption that finer measurement distinctions were 
possible for measurements in the original system 5* than in the meter system M. 
In that case, why would one bother with "meter measurements" in M at all? 
And it seems almost a conceptual contradiction that finer measurement results 
{j, i) in S could somehow emerge from the coarser results j in M. It is unclear 
to me how this viewpoint could be related to actual experiments. 

DAJ claims that 

". . . [the] WV [weak value] can be subsumed as a special case in the 
CV [contextual value] formalism" , 

but this seems to me not so obvious, and quite likely false. A convincing proof 
would surely require careful statements of assumptions and a careful definition 
of "minimal disturbance limit" . The converse seems more likely: that WV 
theory subsumes CV theory. 



I was trained as a mathematician and only later developed a serious interest in 
physics. I have often been appalled at the general unreliability of the physics 
literature. Over the years, I have learned not to waste time on papers which 
use undefined symbols or terms, and that claimed results of calculations can be 
trusted only after one has performed them for oneself. 

If I had followed that policy with DAJ, I would have avoided a lot of work 
and aggravation. But I was sucked in by the tantalizing but unfortunately false 
assumption that "minimal disturbance limit" had to have a meaning something 
like the ordinary meaning of the English words composing it, and the probability 
that if it did, DAJ's results might be in contradiction to mine. I submit this 
to the arXiv in the hope that it may save other readers from similarly wasting 
their time. 



12 A possibly definitive counterexample to DAJ's 



Earlier versions of this paper noted a major gap in DAJ's passage from its 
"general conditioned average" (6) to what it calls the "quantum weak value" 



11.3 A final word 
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(7) (which for pure states reduces to the traditional weak value (jlOOp ). Versions 
1 and 2 presented a counterexample to (7) assuming my guess (|105p at the 
meaning of DAJ's undefined "minimal disturbance limit". But, as described in 
the Afterword to Version 3 above, my guess turned out to be wrong. 

DAJ was assuming that the measurements satisfied a slight generalization of 
the definition of "minimally disturbing measurement" given in Section 1.4.2 of 
Wiseman and Milburn's book [TI] o Wiseman and Milburn define a "mini- 
mally disturbing measurement" as one for which the measurement operators are 
positive. I shall use this definition in place of DAJ's slightly more general one 
(as described in their reply to Physical Review Letters (PRL) but not clearly 
given in DAJ) because it is simpler and a counterexample to (7) using Wise- 
man/Milburn's more restrictive definition is also a counterexample to (7) under 
DAJ's more inclusive version. 

However, because there is no limit involved in Wiseman/Milburn's definition 
(nor in the only description of DAJ's definition given in PRL), this still leaves 
DAJ's "minimal disturbance limit" partially undefined. I shall complete the def- 
inition by assuming that the "limit" refers to my original guess (jlOSp . In their 
reply to PRL, the authors of DAJ refer to (llOSp as defining "ideally weak mea- 
surement" . Since I do not know if they make any distinction between "ideally 
weak measurement" and mere "weak measurement"Ell[^ I shall use the simpler 
term "weak measurement" to refer to a measurement satisfying (jl05|) .Our goal 
is to find a set {Mj{g)} of positive measurement operators, contextual values 
aj{g), an initial state p, and a final pure state /, such that (7) does not hold, 
assuming the "weak measurement" condition (|105p . 

We start with equation (|109|) for positive measurement operators Mj, but 
for simplicity we abandon DAJ's baroque notation in which (jlOSp is written 
(e.g., we write Mj in place of Mj). Since a positive operator on a complex 
Hilbert space is automatically Hermitian, we replace all M| in (|109p by Mj and 
explicitly introduce the weak measurement parameter g, obtaining: 

numerator of (6) — 

Tr [Pf J2 a,{l/2){M,{g)M,{g)P + pM,{g)M,{g)}] 

3 

^^These are the same as what are called "minimum disturbance measurements" in the 
Afterword to Version 3; the latter is the term used in DAJ's reply to my "Comment" submitted 
to Physical Review Letters. When I wrote the Afterword, I did not have access to Milburn 
and Wiseman's book 

■^"They do not correspond to what one might guess given only the normal associations of 
the English phrase "minimally disturbing measurements" ; see the Afterword to Version 3 for 
more information. 

^^The authors of DAJ have ignored a direct question about this, as they have ignored all 
my recent correspondence. 

■^■^Our counterexample which assumes what DAJ call "ideally weak measurement" l|105| l will 
also be a counterexample under any weaker assumption, which presumably includes the mere 
weak measurement or "weak limit" assumption of DAJ. It is annoying that one has to guess 
at the meaning of such terms. 
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-Tr [Pf a,i [M,{g), p] M,{g) + M,{g) [p, M,{g)] ) ]. (127) 



As noted after (I105|l . the first term of (|127p yields (after division by the denom- 
inator of (6), (|107p ) the weak value (7), so a counterexample will result if we 
can find positive Mj (g) satisfying ()105p and states p, f with Tr [Pf p] ^ for 
which the limit as g — )■ of the second term of (11271) is nonzero. This second 
term can be rewritten with a double commutator^ 

Tr [Pf J2 ( (g) , p] M, {g) + M, {g)[p, M, {g)] ) ] 
j 

= Tr[PfY^~a,[M,{g),[M,{g),p\]] . (128) 



If we expand Mj{g) in a power series Mj{g) 



(0) 



+ gMf'^ + g^M. 



(2) 



intuitively we would expect the constant term A/? to be a multiple of the identity 



in order to satisfy the weak measurement condition (|105|) . Therefore, if Afj^-* ^ 
0, we would expect the double commutator to be of order g^, at least for some 
states p. Hence to make (|128l) nonzero, it would probably be necessary to 
arrange that otj{g) go to infinity as 1/g^ as 5 — > 0. So, this will be our initial 
goal, and after achieving it, we shall return to analyze (|128p more closely. 

Besides assuring that (|128l) be nonzero in the limit <? — >■ 0, we must define the 
Mj{g) as positive operators such that for some {ctj}, the following two equations 
hold: 

(129) 



m]m.j = V Af2 = /, and 



E 



A 



(130) 



where A is the operator to be "weakly measured". We shall take ^ to be a 
one-dimensional projector on a two-dimensional Hilbert space, represented by 
a matrix 

' 1 




A := 



(131) 



We shall use three measurement operators: 



Mi{g) 
MM 



1/2 + g 






1/2-3 



hUg) 



1/2 -g 
1/2 4 



U32) 



[I-Ml{g)-Mi{g)Y/^ 







v/1/2 - 

Vl/2-2g2 



^•^I am indebted to a private communication from the authors of DAJ for this observation, 
which was part of an attempted proof that (6) imphes (7) in the "minimal disturbance hmit" . 
Though not strictly necessary for the present counterexample, it provides helpful motivation 
and simplifies the calculations. 
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The thing to notice is that we are choosing Mi and M2 smah enough that M3 
is uniquely determined by (|129|) and in such a way that some of the contextual 
values aj[g) can be chosen to be of order 

Writing out (|130|) in components gives two scalar equations in three un- 
knowns: 



{1/2 + gf a^{g) + {1/2 ~gfa2{9) + {1/2 -2g^)aM = 1 
{l/2-gf ai{g) + {1/2 + gfa2{g) + {1/2 ~2g^)a:i{g) = 



(133) 



Since we want to make at least one of the aj of order l/g"^, let us attempt 
to define ai{g) := \/g^ and solve the remaining system for 02 and a^. The 
remaining system is consistent, and the full solution is: 



ttiCg) 



1 



"2(5) 



9^ 



2g 



, "3(9) 



_ 12g2 + - 4 

4g2(4g2 - 1) 



(134) 



(This has been checked both by hand and by a computer algebra program.) 

To see with minimal calculation that this will produce a counterexample, 
note that for 

Pii P12 
P21 P22 



(135) 



and for any diagonal matrix 



D 



di 

d2 



[D,p] 



{d2 - difp2i 



In particular for j = 1,2, 

[M,{g), [M,{g),p\]^ 



{di - d2)pi2 

{d2 ~ di)p2i 





and 
{di - d2fpi2 



43^21 











and since M^{g) is a multiple of the identity, [M^{g),p\ — 0. Hence (|128p be- 
comes: 



(l/2)Tr [PfY^-a,[M,{g), [M,{g),p\ 



-Tr [Pf 



4pi2 
4p2i 



0{g). 



(136) 
(137) 



The trace is easily seen to be nonzero for pi2 7^ and appropriate Pf (see 
below) . 
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Combining (|106p . (|107p . and (|128p . gives the weak limit of (6) for this exam- 
ple as expression (7) plus a term which does not always vanish. For example, 
for a norm 1 vector / (/i, /2) 

; f^A^ TT[Pf{A,p}] , -8jR(/*/ip2i) 

weak limit of (6 =————— + -1-7-12 .rysoff*f VTTT^ ■ (^^^ 

2Tr [P/p] l/irpii + 23fi(/2*/iP2i) + |/2rp22 



The first term is DAJ's expression (7) written in our notation. 

:pression ( 

(139) 



difference between weak limit of (6) and expression (7) — 
-85R(/2*/ip2i) 



|/l|2pil+25?(/2*/lP2l) + |/2Pp22' 

Some may feel uneasy about this example because M'i{g) is a multiple of 
the identity, which may seem suspiciously trivially The above measurement 
operators were chosen to give a simple example with minimal calculation. Less 
trivial examples should be obtainable by appropriately perturbing the above 
measurement operators, e.g., adding to Mi or M2 terms which are of order 
or greater, so that in the limit g — > they will not affect the anomalous term in 
p27p . I have not pursued this because the calculations rapidly get messy, and 
it is not clear what would be learned from them. 



13 Appendix: A mathematician's view of the 
Moore-Penrose pseudoinverse 

It took me perhaps half an hour to translate DAJ's complicated description of 
the Moore-Penrose pseudoinverse into something which I could easily visualize. 
Such descriptions are not uncommon in the physics literature. This appendix 
is written as a service to those who might be interested in how mathematicians 
think about such things. 

Let A be an operator from a Hilbert space 5 to a possibly different Hilbert 
space K . For simplicity, I will assume below that all Hilbert spaces mentioned 
are finite dimensional, but nearly all of what will be said also works for infinite 
dimensional spaces, with appropriate changes of language and qualifications. 
Many people feel more comfortable thinking of A as a matrix, but mathemati- 
cians have learned that unless the problem at hand specifically involves matrices, 
it is usually easier and more insightful not to. 

There are three important subspaces of associated with a given operator A. 
Its nullspace, denoted Null(A) is defined as the set of all vectors n € S such 
that An = 0. Its range, denoted Range(yl) is defined as the set of all vectors 
r in K which are of the form r — As for some vector s e 5. Perhaps less well 

■^*This example may be somewhat artificial, but it is not physically unrealistic. One could 
realize this situation by first performing a classical experiment with two outcomes called A 
and B, each occuring with a definite probability. If A occurs, perform a quantum experiment 
with outcomes 1 and 2; if B occurs, do nothing further. Outcome B would correspond to 
outcome 3 in a quantum experiment with measurement operators Mi , M2 , M3 . 
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known is the initial space of A, denoted In(A), which is defined as the orthogonal 
complement of the nuUspace: 

In(yl) := Null(^)^ :== {s e S\ {s,n) ^ for all n £ Nun(A)}. 

The restriction of A to its initial space, denoted yl|In(yl), maps In(^) onto 
Range(A), and this restriction is always 1:1. This is because if ^|In(^) : 
In(^) — >■ K were not 1:1, it would have a nontrivial nullspace, but In(yl) is, 
by definition, orthogonal to Null(^). 

This suggests that it might be profitable to temporarily change our object 
of study from the original operator A : S ^ K to a. new operator ^|In(A) : 
In(^) — > K. In accordance with this broader view, consider an operator C : 
H ^ K which maps a Hilbert space H into a Hilbert space K. (We do not 
assume that C maps H onto K.) 

An operator B : K ^ H is called a left inverse of C : — > X if BC = Ih, 
where Ih denotes the identity operator on H. It is easy to describe how to 
construct all left inverses B for C. Any left inverse is uniquely determined on 
the range of C, i.e., for k = Ch with h E H, hy the equation 

Bk = BCh = Inh = h . 

If given k E K there is more than one h E H with Ch — k (i.e., if C is not 1:1), 
then C has no left inverse. 

That uniquely defines the left inverse B on Range(C) when C is 1:1. B can 
be defined any way we choose (subject to linearity) on Range(C)^. Since K is 
the direct sum of Range(C) and Range(C)^, B is uniquely defined on all of K 
once it is defined on Range(C) and Rangc(C)-'-. 

The simplest way to define B on Range(C)^ is to make it identically zero 
there, and this leads us to the definition of the Moore-Penrose pseudoinverse, 
denoted A'^, for an operator A : S ^ K. We just take H := In(^) and apply 
the preceding discussion. Define : K In (A) C 5* to be the unique left 
inverse for A|In(A) which is zero on Range(A)^. If one works through the 
complicated definition of the Moore-Penrose pseudoinverse given in DAJ, one 
sees that this is how it is defined, at least for the case in which the eigenspaces 
of AA'^ corresponding to nonzero eigenvalues are one-dimensional 1^ 

Most mathematicians probably would define the Moore-Penrose pseudoin- 
verse v4+ of A as the unique left inverse for ^|In(74) : In(^) K which anni- 
hilates the orthogonal complement of Range(74). It is a genuine left inverse for 
^|In(v4) : In(^) — ^ K, but not ioi A : S -i- K; the latter has no left inverse 

^^For the general case, the prescription of DAJ is either incomplete or incorrect, depending 
on the interpretation. The question is the meaning of their definition of U as the orthogonal 
matrix "composed of the eigenvectors of FF^". I would guess that this would mean that the 
columns of U are eigenvectors of FF^ (which still only defines U up to multiplying its columns 
by scalars when the eigenvalues of FF^ all have multiplicity 1, with further ambiguity possible 
in more general cases), but if that is what they meant, then their construction is incorrect. A 
simple counterexample is a unitary matrix F which is not diagonal, in which case U, V, and 
S could all be diagonal under the above interpretation. 
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unless In(A) = S because otherwise A has a nontrivial nullspace and so is not 
1:1. 

That is concise and easy to visualize. In addition, it generalizes immediately 

to operators A on an infinite-dimensional Hilbert space space (without the com- 
pactness hypothesis for A mentioned in footnote 12 of DA J), in which case the 
pseudoinverse will be a so-called "closed" , possibly unbounded, operator. This 
pscudoinverse will be bounded if and only if is an isolated point of the spec- 
trum of A. (Even if A is compact, this pseudoinverse may be unbounded.) 

If one is given ^ as a matrix, the question of how to write down the Moore- 
Pcnrose pseudoinverse as a matrix naturally arises. This is a computational, 
rather than conceptual, question which is a bit messier, and that is where the 
singular value decomposition may come in (though one can avoid it by using 
the simpler polar decomposition). If one is dealing with matrices, one may be 
forced into messy matrix descriptions, but otherwise it is usually profitable to 
avoid them. 
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